AITopics | semantic mask

Collaborating Authors

semantic mask

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

1 Hosting Licensing and Maintenance Plan

Neural Information Processing SystemsFeb-15-2026, 22:15:11 GMT

The dataset will be available for a minimum of five years, with no plans for removal. We will ensure ongoing maintenance to verify and maintain data accessibility. For what purpose was the dataset created? Was there a specific task in mind? Who created the dataset (e.g., which team, research group) and on behalf of which Who funded the creation of the dataset?

artificial intelligence, dataset, please provide, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Cook County > Chicago (0.05)
North America > United States > California > San Francisco County > San Francisco (0.05)
North America > United States > California > Los Angeles County > Los Angeles (0.05)

Industry:

Law (0.68)
Information Technology > Security & Privacy (0.68)

Technology:

Information Technology > Security & Privacy (0.68)
Information Technology > Artificial Intelligence (0.47)
Information Technology > Communications (0.47)

Add feedback

4e9fa6e716940a7cfc60c46e6f702f52-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 16:03:17 GMT

arxiv preprint arxiv, large language model, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Asia > China > Jiangsu Province > Nanjing (0.04)
North America > United States > California (0.04)
Asia > China > Tianjin Province > Tianjin (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.97)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models Lihe Y ang

Neural Information Processing SystemsFeb-10-2026, 14:37:00 GMT

Semantic segmentation has witnessed tremendous progress due to the proposal of various advanced network architectures.

machine learning, natural language, synthetic image, (20 more...)

Neural Information Processing Systems

Country:

Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)
Asia > China > Hong Kong (0.04)

Industry: Information Technology (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models

Neural Information Processing SystemsDec-27-2025, 06:38:29 GMT

Foundation models have achieved remarkable results in 2D and language tasks like image segmentation, object detection, and visual-language understanding. However, their potential to enrich 3D scene representation learning is largely untapped due to the existence of the domain gap. In this work, we propose an innovative methodology called Bridge3D to address this gap by pre-training 3D models using features, semantic masks, and captions sourced from foundation models. Specifically, our method employs semantic masks from foundation models to guide the masking and reconstruction process for the masked autoencoder, enabling more focused attention on foreground representations.

domain gap, foundation model, name change, (3 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.77)
Information Technology > Artificial Intelligence > Vision (0.60)

Add feedback

FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models

Neural Information Processing SystemsDec-24-2025, 18:13:25 GMT

Semantic segmentation has witnessed tremendous progress due to the proposal of various advanced network architectures. However, they are extremely hungry for delicate annotations to train, and the acquisition is laborious and unaffordable. Therefore, we present FreeMask in this work, which resorts to synthetic images from generative models to ease the burden of both data collection and annotation procedures. Concretely, we first synthesize abundant training images conditioned on the semantic masks provided by realistic datasets.

annotation make stronger segmentation model, dense annotation make, synthetic image, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.77)

Add feedback

End-to-end Autonomous Vehicle Following System using Monocular Fisheye Camera

Zhang, Jiale, Qian, Yeqiang, Qin, Tong, Jiang, Mingyang, Chen, Siyuan, Yang, Ming

arXiv.org Artificial IntelligenceNov-25-2025

The increase in vehicle ownership has led to increased traffic congestion, more accidents, and higher carbon emissions. Vehicle platooning is a promising solution to address these issues by improving road capacity and reducing fuel consumption. However, existing platooning systems face challenges such as reliance on lane markings and expensive high-precision sensors, which limits their general applicability. To address these issues, we propose a vehicle following framework that expands its capability from restricted scenarios to general scenario applications using only a camera. This is achieved through our newly proposed end-to-end method, which improves overall driving performance. The method incorporates a semantic mask to address causal confusion in multi-frame data fusion. Additionally, we introduce a dynamic sampling mechanism to precisely track the trajectories of preceding vehicles. Extensive closed-loop validation in real-world vehicle experiments demonstrates the system's ability to follow vehicles in various scenarios, outperforming traditional multi-stage algorithms. This makes it a promising solution for cost-effective autonomous vehicle platooning. A complete real-world vehicle experiment is available at https://youtu.be/zL1bcVb9kqQ.

artificial intelligence, machine learning, vehicle, (15 more...)

arXiv.org Artificial Intelligence

2511.19011

Country: Asia > China (0.70)

Genre: Research Report > Promising Solution (0.54)

Industry:

Transportation > Ground > Road (1.00)
Energy (1.00)
Transportation > Passenger (0.93)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

SGDFuse: SAM-Guided Diffusion for High-Fidelity Infrared and Visible Image Fusion

Zhang, Xiaoyang, Li, jinjiang, Fan, Guodong, Ju, Yakun, Fan, Linwei, Liu, Jun, Kot, Alex C.

arXiv.org Artificial IntelligenceNov-25-2025

Infrared and visible image fusion (IVIF) aims to combine the thermal radiation information from infrared images with the rich texture details from visible images to enhance perceptual capabilities for downstream visual tasks. However, existing methods often fail to preserve key targets due to a lack of deep semantic understanding of the scene, while the fusion process itself can also introduce artifacts and detail loss, severely compromising both image quality and task performance. To address these issues, this paper proposes SGDFuse, a conditional diffusion model guided by the Segment Anything Model (SAM), to achieve high-fidelity and semantically-aware image fusion. The core of our method is to utilize high-quality semantic masks generated by SAM as explicit priors to guide the optimization of the fusion process via a conditional diffusion model. Specifically, the framework operates in a two-stage process: it first performs a preliminary fusion of multi-modal features, and then utilizes the semantic masks from SAM jointly with the preliminary fused image as a condition to drive the diffusion model's coarse-to-fine denoising generation. This ensures the fusion process not only has explicit semantic directionality but also guarantees the high fidelity of the final result. Extensive experiments demonstrate that SGDFuse achieves state-of-the-art performance in both subjective and objective evaluations, as well as in its adaptability to downstream tasks, providing a powerful solution to the core challenges in image fusion. The code of SGDFuse is available at https://github.com/boshizhang123/SGDFuse.

artificial intelligence, image understanding, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2508.05264

Country: Asia > China > Shandong Province (0.28)

Genre: Research Report > New Finding (0.67)

Technology: